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~SYSTEM AND METHOD FOR TRANSACTION RECORDING AND PLAYBACK 

COPYRIGHT NOTICE 

A portion of the disclosure of this patent document contains material which is 
subject to copyright protection. The copyright owner has no objection to the facsimile 
rerroduction by anyone of the patent document or the patent disclosure, as it appears in the 
Pafjent and Trademark Office patent files or records, but otherwise reserves all copyright rights 
whatsoever. 

RELATED APPLICATIONS 
This application is related to the following patent applications, which are hereby 
incorporated by reference in their entirety: 

• Application serial no. 09/724,025 titled METHOD AND SYSTEM FOR 
PREDICTING CAUSES OF NETWORK SERVICE OUTAGES USING 
TIME DOMAIN CORRELATION, attorney docket no. 3882/3, filed 
November 28, 2000; 

• Application serial no. 09/9 1 0,676 titled IMPROVED EVENT DATABASE 
MANAGEMENT METHOD AND SYSTEM FOR NETWORK EVENT 
REPORTING SYSTEM, attorney docket no. 3882/7, filed July 20, 2001; and 

• Application serial no. 09/928,245 titled SYSTEM AND METHOD FOR 
TESTING MULTIPLE DIAL UP POINTS IN A COMMUNICATIONS 
NETWORK, attorney docket no. 3882/10, filed August 10, 2001. 

BACKGROUND OF THE INVENTION 
The invention disclosed herein relates generally to tracking a user's actions within 
an information system. More particularly, the present invention relates to tracing a user's 
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navigation between information resources within an information system to create a transaction 
data file and replay the steps contained therein to verify the level of service being offered by the 
server hosting the information resource. 

Internet search engines are employed by users to locate specific information 
5 resources on the Intemet, e.g., web pages and the interconnections between them. In order to 
catalog vast number of information resources on the Intemet, these search engines deploy 
automated applications referred to alternately as "spiders", "crawlers", or "bots". Applications 
such as these navigate to and catalog sites submitted to the search engine by the site owners as 
new or updated and, therefore, need to be cataloged. These applications may also be given a set 

f40 of seed sites to begin their process of navigation and cataloging. Spider apphcations get their 

G 

f U name firom the fact that they typically retrieve information resources form a plurality of sites in 



ru 



parallel. Spiders and bots may also build a catalog of sites by following all the hypertext links in 
a series of web pages until all the pages have been read. 



fn Spiders, crawlers, and bots are useful tools in cataloging the contents of web sites 

fU 

5 and other servers of information resources. These applications, however, only collect 

9 

information regarding the sites that they visit once. They are therefore only able to verify the 
availability of an information resource when the resource is cataloged. In order for a user to 
determine the availability of an infonjiation resource identified by a search engine, he or she 
must attempt to load the resource. Because no availability or service level information is 
20 provided, time is needlessly consumed testing hyperlinks instead of being directed to information 
resources that are indeed available when the user is attempting to access them. 

A similar problem is encountered by entities that manage a large number of 
servers where availability must be ensured. In order to guarantee availability of the information 
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resources hosted by the managed servers, a user must attempt to retrieve selected information 
resources on a regular basis. Altematively, a script or macro may be set to query each server on 
a regular basis to ensure that they are transmitting data. This alternative, however, fails to 
provide infomiation regarding the levels of service being offered by the managed servers. A 
third alternative involves a combination of the two methods, which is clearly a less than ideal 
solution to the problem. 

There is thus a need for a system and method whereby a user's transactions are 
automatically learned or recorded in order to maintain a history of a user's navigation 
transaction. Advantageously, this history may be used to retrace the steps of a user's transaction 
to validate availability of resources accessed by the user and the service levels being offered. 

BRIEF SUMMARY OF THE INVENTION 

The system and method of the present invention comprises functionality to record 
the steps of a user's navigation transaction and to subsequently play back those steps to 
determine service availability and levels. The present invention comprises a method for 
recording a user's steps in a navigation transaction by retrieving an information resource and 
calculating service level thresholds and the time required to retrieve the information resource. 
The service level thresholds and parameters regarding the information resource is recorded in a 
transaction data file as a step comprising the navigation transaction. 

The method of recording the steps of a user's navigation transaction may further 
comprise aborting the step of retrieving when a timeout threshold is exceeded. The step of 
recording may comprise recording a number of parameters including the address and port of the 
retrieved information resource and generating conditional logic used to instruct service software 
as to the correct service level code to return based on the service software's observed time to 
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retrieve the information resource. The step of generating conditional logic comprises conditional 
logic defining service levels of GOOD, MARGINAL, and FAILED. The step of recording may 
comprise recording the step information in a hierarchy, each step comprising an entry in the 
hierarchy with the transaction being the top level of the hierarchy. 

The present invention also comprises a method for playing back one or more steps 
executed by a user as part of a navigation transaction, the navigation transaction stored in a 
transaction data file. The method for playing back one or more steps executed by a user as part 
of a navigation transaction comprises identifying a first step within the transaction, executing the 
first step by attempting to retrieve an information resource identified by the step, and returning a 
level of service for a server hosting the information resource identified by the step. 

As a transaction data file may comprise multiple steps, the method may fiirther 
comprise identifying one or more subsequent steps within the transaction, executing the one or 
more subsequent steps by attempting to retrieve information resources identified by the 
subsequent one or more steps, and returning levels of service for servers hosting the information 
resources identified in the subsequent one or more steps. The time to execute the first step may 
be calculated and whereby the step of returning the level of service comprises calculating the 
level of service based on the amount of time required to execute the first step. 

The present invention also comprises a system for recording a user's steps in a 
navigation transaction and for playing back those steps in order to determine a level of service 
for each step. The system of the present invention comprises transaction recorder software. The 
transaction recorder software is operative to calculate service level thresholds and the time 
required to retrieve a requested information resource. The transaction recorder software is 
further operative to generate conditional logic used to instruct service software as to the correct 
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service level code to return based on the service software's observed time to retrieve the 
information resoxirce. The system also comprises service software operative to attempt to 
retrieve the requested information resource and retum a level of service defined by the 
conditional logic based on the amount of time required to retrieve the information resource. 

The invention ftirther provides a transaction data structure for facilitating 
recordation of a user's navigation steps within a transaction and playback of those steps. The 
transaction data structure may be structured as a hierarchical arrangement of data whereby the 
top level of the hierarchy defines the name and description of the transaction. Within the 
transaction is a second level of the data hierarchy where each step of the transaction is defined, 
e.g., each information resource retrieved. According to an altemative embodiment of the 
invention, the transaction data file is formatted according to an XML schema defining the 
various steps of the transaction and associated data. 

The data necessary to replay each step in the transaction is included within the 
transaction file. More specifically, these data include the information resource to retrieve, the 
action to perform, and the status to retum based on the time required to retrieve the information 
resource. In this manner, the information required to retrieve the resource, logic regarding how 
to define whether the retrieval is successfiil, and the level of service provided by the server 
hosting the information resource identified in the step. 



The invention is illustrated in the figures of the accompanying drawings which are 
meant to be exemplary and not limiting, in which like references are intended to refer to like or 
corresponding parts, and in which: 



BRIEF DESCRIPTION OF THE DRAWINGS 
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Fig. 1 is a block diagram presenting the transaction recording software in a 
network environment according to one embodiment of the present invention; 

Fig. 2 is a block diagram presenting an arrangement of software components for 
transaction recording according to one embodiment of the present invention; 

Fig. 3 is a screen diagram presenting a configuration of graphical controls for 
transaction recorder software according to one embodiment of the present invention; 

Fig. 4 is a transaction data file presenting the steps of a user's transaction 
according to one embodiment of the present invention; 

Fig. 5 is another view of a transaction data file presenting a single step of a user's 
40 transaction according to one embodiment of the present invention; 



fU Fig. 6 is a flow diagram presenting method of operating transaction recorder 



iU software in order collect a user's transaction steps according to one embodiment of the present 

4 

invention; and 



ru 



Fig. 7 is a flow diagram presenting a method of using a transaction file recorded 

ru 

5 by the transaction recorder software to verify the availability of an information resource 
f * according to one embodiment of the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

With reference to Figs. 1 through 7, embodiments of the invention for transaction 
recording and playback are presented. In a networked computing environment, for example, the 
20 Intemet, a number of computing devices 102, 1 10, 1 16, and 122 are interconnected through one 
or more networks 130. Through the use of the network 130 and associated hardware and 
software components that comprise the network infi-astructure, e.g., routers and switches, the 
computing devices are capable of transmitting ^disparate types of data between themselves. 
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The World Wide Web (WWW) presents one example of a networked computing 
environment. According to the World Wide Web Consortium (W3C), the World Wide Web 
encompasses "the xmi verse of network accessible information, an embodiment of human 
knowledge". More specifically, the WWW comprises all the information resources and users on 
the Internet that are using the Hypertext Transfer Protocol (HTTP) to exchange information, 
typically in the form of HTML web pages. Indeed, the WWW, and the Intemet in general, may 
be used to pass many kinds of disparate information resources between requesting clients and 
responsive servers. 

A computing device 102 is provided that is used to navigate between servers 1 10 
1^0 and 116 located on the network providing information in response to requests. The computing 

device 102, which may be any type of personal, portable or handheld computing device, 
y comprises transaction recorder software 106 that is used to generate and maintain a record of the 
servers 110 and 116 accessed by the client 102, Advantageously, the transaction recorder 
software 106 may incorporate one or more software components (not pictured) that allow the 



fil 

vf - 



S-5 ! 



ru 

' 5 user to issue requests for information resources to servers accessible over the network 130, the 



U transaction recorder 106 software itself may also be retrieved off a network 130 and installed into 
the computing device's 102 local memory.. Alternatively, the transaction recorder software 106 
may be executed in conjunction and communication with web browser software, such as 
MICROSOFT INTERNET EXPLORERS", whereby information requests, as well as information 
20 returned to the browser in response to these requests, is analyzed and recorded by the transaction 
recorder software 106. The transaction recorder software 106 is used to analyze the retrieved 
information resources and record data that includes, but is not limited to, the title of the retrieved 



7 



Express Mail No.: EK510832775US 



3882/12 



information resource, the server or page where the use navigated from, the command passed to 



the server, and the time required to retrieve and render the information resource for presentation 



to the user. 



Although there are many types of servers that comprise functionality to deliver 
5 information according to the Hypertext Transfer Protocol (HTTP), these servers 1 10 and 1 16 



generally distribute either static 1 10 or dynamic 116 information. Servers that transmit static 



information 1 10 comprise a document server 1 12, e.g., a web server, and a collection of one or 



more static documents 1 14, e.g., HTML or XML docmnents, for distribution upon request. A 



client 102 requests a document by providing the document's uniform resource locator (URL) 

.f40 identifying the server containing the document 1 10 as well as the document itself 114. 

O 

ry Accordingly, the transaction recorder software 106 executing on the requesting client 102 makes 

i v a note of the requested transaction and associated parameters, e.g., time to load the information 

■ resource, which is either appended to an existing or newly created transaction data file 108. 



fl I Other information servers 116 dynamically generate pages of information in 

ru 

«p5 response to information requests. These pages, for example, may be tailored to contain 
f information pertaining to only certain topics or to the profile of the user making the information 
request. The document server software 118, e.g., a web server, dynamically generates a URL to 



retrieve content, typically stored in a database 120 or other data store, and prepare the page at the 



time it is requested. The transaction recorder software 106 maintains the dynamically created 



20 URLs used to reference these information resources so that the resource may be recreated and the 



document accessed at a point in the future. As is the case when retrieving static information 114 



from a document server 112, the transaction recorder software 106 executing on the requesting 



cHent 102 makes a note of the requested transaction and associated parameters, e.g., time to load 
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the information resource, the action performed by the transaction software, etc. This transaction 
information is either appended to an existing or newly created transaction data file 108. 

The transaction recorder software 106 generates one or more transaction data files 
108 as the user navigates between information resources or pages hosted by servers 110 and 116 
located on a network 130, each file containing a sequence of information resources retrieved 
from the network and parameters associated therewith. These transaction data files 108 may be 
transmitted over the network 130 to a service server 122. The service server 122 comprises 
service software 126 that is used to test the availability of servers 110 and 1 16 on the network 
130. Altematively, the computing device executing the transaction software 102 may directly 
upload the transaction data file to the service server 122 in order to simplify integration of the 
two. By supplying the address of the service server 122, e.g., an IP address, the transaction 
recorder software may perform a direct transfer by utilizing HTTP. 

The service software 126 loads a transaction data file 128 received from a client 
device 102 and proceeds to "playback" the information contained therein. For example, suppose 
a transaction data file 108 comprises information indicating that the user visited three web sites 
and requested several pages of information from each site. As is explained in greater detail 
herein, the transaction data file 108 comprises resource and associated parameter information as 
collected by the transaction recorder software 106. Using the data contained in the transaction 
data file 108, the service software 126 attempts to access and retrieve each piece of information 
that the transaction data file 108 indicates that the user retrieved. In this manner, the service 
software 126 plays back the user's actions to ensure that all the retrieved resources are available 
on the network. Moreover, the service software 126 uses the data in the transaction file to 
calculate the level of service being offered by each server 110 and 116. 
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One advantageous application of the invention is whereby a user loads 



information into the service software from a variety of managed servers 110 and 116 that is 
captured by the transaction recorder software 106 in a transaction data file 108. Using the 
transaction data file 108 and service software 126, the user may ensure, at any point in time, that 
the managed servers 1 10 and 1 16 are available on the network 130. As is explained in greater 
detail herein, the service software 126 calculates a level of service for each server hosting an 
information resource retrieved by the user as defined by the steps in the transaction data file. The 
service level is referred to as discrete value classification (DVC) and, according to one 
embodiment, may be set to GOOD, MARGINAL, or FAILURE. 



comprises part of the NETCOOL® Internet Service Monitor package for network service 
monitoring developed by Micromuse Inc. As described fiirther in the above referenced patent 
applications, incorporated herein by reference in their entirety, the ISM provides real-time fault 
management and service assurance. Network probes and monitors provided by the ISM 
collect event information from popular network environments such as SNMP and non- 
SNMP management applications, voice or data networking equipment, Internet and wide- 
area services, and computer systems and databases. Events are monitored, e.g., common 
network occurrences such as alerts, alarms, or other faults, which may be reported in 
customizable graphical and text formats allowing network operators to graphically see 
what is happening within the network in real-time. The addition of the service software 
to the ISM allows a user to define one or more transactions for playback at a later time 
with an indication of the service level being provided by each step within the transaction. 



According to one embodiment of the invention, the service software 126 
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Turning to Fig. 2, the software components that comprise the transaction recorder 
software introduced in Fig. 1 are presented in greater detail. Software components 204, 206, 
210, 214 comprising the transaction recorder application 202 are executed by a programmable 
digital microprocessor (not pictured) in order to record the information resources retrieved by a 
user, e.g., HTML web pages, and parameters associated with those resources. These software 
components include an address bar 204, a rendering component 206, an event handler 210, and a 
history component 214. Alternatively, the transaction recorder application 202 may comprise 
fewer software components. According to this embodiment, the transaction recorder application 
202 interacts with a web browsing program whereby fiinctionality, such as HTML rendering and 
event handling, is provided by the browser through inter-component conmiunication 
accomplished via COM, EJB, or other similar component commimication model. For example, 
using Microsoft OLE (Object Linking and Embedding) mechanisms, a host application (the 
transaction recorder application) may tum off menus and toolbars of the guest component (e.g., 
Microsoft Intemet Explorer) and supply its own UI as well as intercept context menus and 
message boxes generated the by guest component. 

According to one preferred embodiment, a user interacts with the transaction 
recorder application 202 through a graphical user interface in order to supply the location of 
desired resources and view the retrieved resuhs. Using graphical controls presented to the user 
on a display device, the user supplies a URL for a desired information resource in the address bar 
component 204. The formulated resource request is passed to the event handler 210 using one of 
the inter-component communication methods well known to those skilled in the art. The event 
handler component 210 is responsible for receiving events and acting on events generated by the 
other software components of the application 202. 
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A rendering component 206 is provided to receive information and present it to 
the user on a display device. The rendering component 206 receives markup or other data and is 
operative to render the resultant page. For example, where the information resource is an HTML 
web page, the rendering component reads each markup tag and associated data comprising the 
page in order to present the page ta the user. Information resources, however, mat be written in a 
variety of markup languages and comprise data other than ASCII text, for example, XML, 
VRML, SGML, GIF, JPEG, etc. 

Each information resource typically comprises a MIME header that is used by the 
rendering component to select an appropriate "player" appUcation for the type of data the header 
indicates. The MIME header allows the rendering component to display a wide variety of 
content types in additional to providing functionality that allows it to render content types not 
presently developed. According to some embodiments, the rendering component is a web 
browser that is not integrated within the transaction recorder application. Instead, the transaction 
recorder application uses inter-component communication to take advantage of the rendering 
functionality provided by the web browser. 

Upon receiving a request for a resource from the address bar 204, the event 
handler attempts to initiate communication with the target server that contains the desired 
resource. According to some embodiments, the event handler 210 may work with other 
components and/or applications in order to retrieve the resource. The event handler 210 issues 
an information request to the target server for the information resource supplied in the address 
bar 204. The target server receives the request and begins to transmit the information resource, 
which is received by the transaction recorder application 202 and passed to the rendering 
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component 206. Where the target device or information resource is unavailable, a page of 
information indicting this result is passed to the rendering component. 

The event handler 210 is also operative to receive an alert indicating that the 
information resource has been received, rendered, and navigation is complete. The information 
5 resource is examined and information is collected regarding the rendered information resource. 
Information captured includes, but is not limited to, the visited URL, the time required to 
download and render the complete information resource, the method used to get the page, any 
parameters passed to retrieve the page, e.g., POST parameters. 

The retrieved information regarding the information resource is recorded in a 
fiO transaction structure 212, which is a transient storage structure used to maintain the transaction 

toe? 

fU information before it is written out to a persistent storage device. Each step in a transaction 

^ created using the transaction recorder application 202 is assigned a name that is appended to the 

ry 

''J" start of the new transaction in order to identify the transaction. 

fy According to embodiments of the invention, the transaction structure 212 is a 

ru 

=3 5 nested hierarchy of steps of navigation requests for information resources performed by the user. 
At the top level of the hierarchy is the name of the transaction. Following are the information 
resources retrieved by the user, e.g., the web sites the user navigates to and requests pages from. 
Nested in the hierarchy one level below each recorded information resource is data regarding the 
information request. At a minimum, the transaction recorder application 202 maintains within 
20 the transaction structure 212 the information resource requested, the action performed, and the 
amount of time required to retrieve the information resource. The service software uses the 
information exported from the transaction structure 212 by the transaction recorder software 202 
to determine whether the recorded information resources are available. 
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Also provided as part of the transaction recorder application 202 is a history 



component 214. As the user executes steps within a transaction, e.g., information resources are 



navigated to and retrieved, information regarding the step is appended to the transaction structure 



212. The addition of each entry to the transaction structure 212 triggers an event that updates the 



5 history component to reflect the additional step that is added to the transaction structure. The 



history component 214 may be presented to the user as a graphical control on a display device. 



The GUI control may be a tree UI control that presents the sequence of information resources 
retrieved and data regarding each information resource in a hierarchical fashion; the data being 
retrieved from the transaction structure 212. Advantageously, where the history component 214 
f40 is a graphical control, an input device may be used to select individual entries from the history 
flJ component 214 for removal, thereby instructing the transaction application 202 to remove the 

W transaction entry from the transaction structure 212. 

fU 

' The transaction recorder application 202 is capable of exporting the contents of 

fU the transaction stmcture in a variety of formats 216, 218, and 220. Data for the steps that are 



==3 5 held in the transaction structure 212 may be written to a storage device as an ASCII text 

docvmient 216. The user is capable of modifying any value in the text file through the use of a 



standard text editor or word processor capable of reading ASCII text. In order to support 



languages other than those based on the Roman alphabet, the text docimient 216 may be output 



using Unicode character set. 



20 The transaction recorder application 202 may also output the contents of the 



transaction structure 212 in a native binary format used by the application 202. Outputting files 



according to this format 218 allows the file to be loaded back into the transaction structure 212 at 



a later point and continue to add steps to the structure 212. Likewise, the file 218 may be loaded 
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and transactions removed from the structure 212. The application may also be supplied with 
formatting details regarding proprietary file formats for use with specific applications and output 
the contents of the transaction structure 212 according to the proprietary file format 220. 

A screen diagram presenting one embodiment of the transaction recording 
application illustrated in Figs. 1 and 2 is introduced in Fig. 3. As described in the previous 
figures, the transaction recorder application 302 is divided into a number of control areas 304, 
306, 308, 322, and 324. Information resources, e.g., HTML web pages server over the Intemet 
by a web server, may be loaded by supplying the address of the resource to the transaction 
application via the address bar 304, which is a graphical control for interacting with the address 
bar software component presented earlier. When a valid address is supplied through the address 
bar 304, the information resource is retrieved and rendered by the rendering component for 
presentation on the display device within a rendering window 306. Also provided are standard 
web browser controls 322 that assist in navigation and menu controls 324 that allow the user to 
customize the application's 302 settings. 

According to some embodiments of the invention, the address bar 304, rendering 
window 306, browser 322 and menu 324 controls may be provided by hooks to Microsoft 
Intemet Explorer. The Intemet Explorer component MSHTML follows the standard OLE 
mechanisms for negotiating with the container for menu merging and displaying toolbars. A host 
of MSHTML, therefore, can turn off the MSHTML menus and toolbars and supply its own user 
interface with the underlying software components or use the supplied interface as shown by the 
use of standard Intemet Explorer controls 304, 306, 322, 324. 

As the user navigates between information resources using the address bar 304, 
navigation controls 322, and hyperlinks presented in the rendering window 306, the navigation 
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steps are written to the transaction structure and displayed to the user through the history 
component's user interface 308. The history UI 308 displays each step included in the current 
transaction structure with the name of the current transaction structure presented as top level of a 
hierarchy of transactions 310. Lying one level beneath the transaction structure name 310 are 
one or more steps 312 that comprise the transaction structure as manifested by the users 
navigation activities. 

As described above, the transaction structure maintains information regarding 
each step within the transaction, which is displayed by the history component 308 within the 
hierarchy beneath the step that the information is associated with 314, 316, 318, and 320. 
Transaction information presented to the user 308 in this embodiment includes URLs both for 
the "From" and "To" information sources, 314 and 316 respectively, that the user is navigating 
between. In the present example, the user is navigating from the root page of the New York 
Times web site to the root page of the ZDNet web site. Also recorded and presented is the time 
3 1 8 in seconds that was required for the requested page to be retrieved and rendered. Although 
the load time 318 is presented here in seconds, other time measurements are contemplated by the 
invention, e.g., miUiseconds. The last data item displayed by the history component 308 is the 
method used to retrieve the requested page 320, e.g., GET ALL or POST. Although not 
presented, the history component 308 may display POST parameters and other data associated 
with the transaction as maintained by the transaction structure. While the standard HTTP 
requests are GET and POST, the intemal convention GET ALL is used by the transaction 
recorder software to issue instructions to GET the page and GET each element on the page. 

Turning to Fig. 4, the text of an exemplary exported transaction data file is 
presented. As described above, the exported transaction file is preferably structured as a 
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hierarchical arrangement of data. The top level of the hierarchy 402 defines the name and 
description of the transaction. Within the transaction is a second level of the data hierarchy 
where each step of the transaction is defined 404, 406, 408, 410. According to an altemative 
embodiment of the invention, the transaction data file is formatted according to an XML schema 
defining the various steps of the transaction and associated data. 

Reading the transaction data file as generated by the transaction recorder software, 
the transaction recording software writes a new transaction entry 402 to the transaction data file 
when a new file is created. After the file is initialized, the software adds a new step to the file 
indicating that the user navigates to the web site for the New York Times 404, which is written to 
the data file with parameters regarding the step. The parameters written to the exported data file 
and their use are explained in greater detail in Figs. 5 and 7. The transaction recording software 
tracks that the user navigates to the Slashdot web site and adds an associated step entry to the 
transaction data file 406. The software also tracks the user as he or she navigates and retrieves 
additional information resources, e.g., web pages, firom the Slashdot web site, 408 and 410. 
According to some embodiments, the order in which the recorded steps are added or arranged 
within the transaction data file is irrelevant. 

As can be seen by the data comprising an individual step within the 
transaction 412, and as is explained in greater detail herein, all data necessary for the 
service software to replay the step is included within the transaction file. More 
specifically, these data include the information resource to retrieve, the action to perform, 
and the status to return based on the time required to retrieve the information resource is 
included 412. In this manner, the service software has the information required to 
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retrieve the resource and logic regarding how to define whether the retrieval is successful 
and the level of service provided by the server hosting the information resource identified 
in the step. 

A breakdown of the elements comprising one step of a transaction is 
presented in Fig. 5. As shown in Fig. 4, an addstep flag 404 identifies the new step entry. 
This value is followed by the transmission method that is used to retrieve the information 
resource 502, in this example HTTP. Another exemplary transmission method would be 
SHTTP or Secure Hypertext Transmission Protocol. This value is followed by the 
address of the server hosting the requested information resource and the actual 
information resource to retrieve 506, along with the port on which the information 
resource is available. Here, the server address is www.nytimes.com (a web server 
transmitting information resources according to HTTP), the requested information 
resource is "/" or the default root page, and the port number is 80. It should also be noted 
that, according to this embodiment, the quotation mark character is used to pass the value 
of a space to the service software upon playback, as spaces are otherwise used to delimit 
values. 

As is explained in greater detail herein, a timeout threshold is supplied 510, set to 
a value of 30 seconds in this example, in order to prevent the service software from waiting 
indefinitely for an information resource to be retumed. Also recorded is the command issued by 
the transaction software in order to retrieve the information resource 510. Here, the value 
GET ALL indicates that the software should collect the page and all elements loaded on the page. 
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As opposed to the quotation marks occurring as part of strings, thereby indicating a value of a 
space within the string, the four quotation marks 514 on the first line of the step represent 
authentication parameters that are required to be passed to certain severs for access. From left to 
right, the four sets of quotation marks hold the following values respectively: usemame, 
password, form name (where there is a form to identify on the page), and a regular expression 
value (a string used to evaluate against a retrieved information resource to ensure the proper 
document is retumed). According to this example, none of these values 5 14 were required by the 
user to retrieve the default root page fi-om the server www.nytimes.com and are therefore not 
required by the service software. 

V 

The first value contained on the second line of the step comprises the title of the 
retrieved information resource 516, a form flag 518 (used to define parameters for form fields 
where the retrieved information resource comprises a form), and a HEAD parameter 520. The 
HEAD parameter 520 is required to inform the server hosting the information resource as to what 
type of application is performing the request and may be used to alter the information resource 
retumed by the server. When played back by the service software, data is transmitted indicating 
that the software is of the type "Mozilla/4.0 (compatible; "MSIE"5.0)". Once again, the 
quotation marks forming part of the string are evaluated as spaces. 

The balance of the information comprising the step consists of DVC logic 522 
that is used by the service software to determine the level of service being offered by the server 
hosting the information resource that is the object of the step. As a threshold matter, the DVC 
codes define three levels of service at which a server may be operating: GOOD, MARGINAL, or 
FAILED. The DVC logic 520 presented in Fig. 5 is reproduced below in Table 1 : 
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DVC GOOD 4 1 FAILED 3 status NEQ 200 status NEQ 301 status NEQ 302 2 FAILED 1 
totalTime GT 26 3 MARGINAL 1 totalTime GT 25 4 FAILED 1 regexpStatus EQ FAILED 

Table 1 

The value GOOD indicates to the service software that this is the default value to return if the 
following gates all evaluate to false. The value "4" indicates that the DVC clause comprises four 
sub-clauses. These sub-clauses are each identified by a sub-clause number. The first sub-clause, 
identified by the value "1" situated between the value "4" and "FAILED", retums a DVC code of 
"FAILED" if any one of the next three conditions are true: the status code retumed by the server 
is not equal to 200, 301, or 302. 

The second sub-clause, identified by the value "2" situated between the value 



Q 
D 

f U "302" and "FAILED", indicates that a "FAILED" DVC code should be retumed by the service 
iHO software where the total time to retrieve the requested information resource is greater than 26 



seconds. The value 1 instructs the service software that there is only one condition associated 
with this sub-clause. The third sub-clause instructs the service software that a "MARGINAL" 
DVC code should be retumed where the total time to retrieve the request information resource is 
l'*^ greater than 25 seconds. Here again, the value 1 instracts the service software that there is only 
1 5 one condition associated with this sub-clause. The final clause indicates that a "FAILED" DVC 
code is to be retumed where the variable regexpStatus is equal to "FAILED". 

The DVC logic presented in Table 1 may be explained using the pseudocode of 

Table 2: 

if ( (status o 200) and (status o 301) and (status o 302)) then 

DVC = FAILED 
else if (totalTime > 26) then 
DVC = FAILED 
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else if (totalTime > 25) then 

DVC = MARGINAL 
else if (regexpStatus = FAILED) then 

DVC = FAILED 

end if 

Table 2 

According to the DVC logic pseudocode presented in Table 2, "status" is set to the code returned 
by the sever when the step is played back by the service software. The value "totalTime" is set 
to the time taken by the service software to retrieve the whole information resource and 
"regexpStatus" is a Boolean value indicating whether a regular expression supplied by the user 
when the step was recorded is present in the retrieved information resource. 

According to one embodiment of the invention, the threshold values associated 
with each of the DVC codes in the step is calculated using the equation of Table 3: 

Time = MAX(AX+B, min) 
Table 3 

The transaction recorder software records the total time required to retrieve the information 
resource, which is substituted for the variable "X" in the equation of Table 3. This is the time 
global required to load the whole page where the information resource is a web page. The 
service software, however, may expect the time as a fimction of the cumulative time required to 
retrieve each element comprising the information resource. 

A coefficient value, "A", is introduced to "de-parallelise" the retrieve time. In 
other words, the time it would take to retrieve all elements in a serial, rather than parallel, 
manner. Where service software expects a global time value, the coefficient is set a value of one. 
The value of "B" is available to add a time offset to take into account any delays associated with 
the service software loading the transaction and initiating the request. As this delay is becomes 
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shorter, the value of "B" is set to a correspondingly small value. Finally, the value of the "min" 
parameter defines a minimum value under which the value should not be set. For example, we 
may assume that if the global response is less than ten seconds, then a site qualifies as GOOD. 
These DVC values associated with each DVC code within the DVC logic may be calculated 
5 when the information resource is retrieved. Altematively, the transaction recorder software 
calculates the DVC values upon export. 

Fig. 6 presents a flow diagram illustrating a method of operating the transaction 
recorder software presented in Figs. 1 though 3 in order to generate a transaction data file. Using 
an input device, such as a keyboard or virtual keyboard presented on a display device and a 
^5 0 pointing device, a user supplies an address of an information resource for retrieval, step 602. The 
jir! user may supply the address by entering it in the address bar or by selecting a hyperlink fi-om 

^ m 

l^y within an HTML document. According to one embodiment, the address is a URL for a HTML 

f U document located on the Internet. Altematively, the address may be a URL for an XML 

Jj: document or an image file. Indeed, the present invention contemplates tracking requests for all 

^1 5 manner of digital media types and formats. An event is generated indicating that an address has 

3 

U been supplied to the system, which causes the software to attempt open a communication channel 
with the indicated server and retrieve the information resource, step 604. 

A check is performed to determine whether the user has supplied a valid address 
for retrieval of the information resource, step 606. Where the address supplied by the user is 
20 invalid, step 606, the software presents an error message to the user indicative of this fact, step 
608. Another check is performed to determine if the user has supplied an additional address, step 
624, for example, corrected the malformed address supplied in step 602. If no additional address 
is supplied, the process concludes, step 626. Where the user supplies a new address, program 
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flow returns to step 604 where the software attempts open a communication channel with the 
indicated server and retrieve the information resource. 

Where the check executed at step 606 returns a value of true indicating the 
resource address is valid, e.g., the software successfiiUy opened a communication channel with 
5 the target server and located the desired information resource, the resource is retrieved from 
across the network, step 612. The requesting client receives the retrieved information resource, 
e.g., the client executing the transaction recorder software, and renders it on the client's display 
device, step 614. According to some embodiments of the invention, the information resource is 
composed of a plurality of elements, each residing on one or more disparate servers on the 

tjo network. The transaction recorder software may retrieve these elements that comprise the 

C3 

ry desired information resource in parallel in order to reduce the time required to retrieve the 
ly complete information resource. 

A calculation is performed, step 616, to determine the duration of time required to 
iTi retrieve the requested information resource, step 612, and render it on the display device, step 

ru 

614. The time calculation is recorded in the transaction structure, step 616. In addition to the 

I : 

time calculation, step 616, resource information is retrieved and recorded, step 618. The 
recorded information includes, but is not limited to, the previous information resource retrieved 
(the "From" address), the address of the current information resource (the "To" address), the time 
required to retrieve and render the information resource, the action performed (e.g., GET ALL, 
20 POST, etc.). The data written to the transaction structure is also used to populate the history 
component and associated UI control, step 620. The data presented by the history component 
provides feedback to the user regarding information collected about the retrieved information 
resource. 
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A check is performed to determine if the user has supplied an additional 
information resource to which to navigate, step 624. If the user has supplied an address for an 
additional information resource, for example, by selecting a hyperlink off of an information 
resource previously retrieved where the resource is an HTML document, program flow retums to 
5 step 604 where the transaction recorder software attempts to open a communication channel and 
retrieve the desired resource. Where the user has not supplied the address of an additional 
information resource for retrieval, program flow terminates, step 626. 

Fig. 7 presents one embodiment of a process for using the service software to play 
back the steps comprising a transaction data file. The sever executing the service software 

1^0 receives a transaction data file and proceeds to load it into memory for playback, step 702. 

P 

fU Typically, a user at a computer executmg the transaction recorder software records the steps of a 
transaction and transmits the completed document to a local storage device attached to the server 

fU 

^ executing the service software for playback. The transaction data files, however, may reside on a 

fU network file device and be retrieved by the service software for playback. Ahematively, the 

FU 

*3 5 service software and transaction recorder software may be executed by the same 

p 

'^^ server/workstation and share a common storage device. Other permutations are also 
contemplated by the invention. 

The service software starts the transaction by attempting to open a communication 
channel and retrieve the information resource listed in the first step of the transaction data file, 
20 step 704. Alternatively, the user may indicate a step as a starting point for playback. A check is 
made to determine if a timeout threshold is exceeded without retrieving the information resource, 
step 706. According to various embodiments, the timeout threshold be may be set either in the 
transaction data file by the transaction recorder software or by the user through the interface 



24 Express Mail No.: EK510832775US 



3882/12 

provided by the server software. The use of a timeout mechanism advantageously prevents the 
service software from crashing by waiting indefinitely to retrieve an information resource. 

If the information resource is not retrieved before the timeout threshold expires, 
step 706, a check is preformed to determine if additional steps are contained within the 
transaction, step 716. Where additional steps are contained within the transaction being played 
back, step 716, program flow returns to step 704 where the service software plays the next step in 
the transaction. If there are no additional steps to perform in the transaction being played back, 
an additional check is performed in order to determine if additional transactions are scheduled for 
playback, step 718. The service software provides fimctionality that allows an administrator or 
other operator of the software to schedule a set or batch of transaction data files for processing, 
as opposed to processing each one manually. 

According to one embodiment, the service software performs a check within an 
indicated directory on a storage device to determine if any transaction data files exist that have 
not been processed, step 718. If so, processing returns to step 702 and the process is repeated 
until all transaction data files in the directory have been processed. Alternatively, a flag may be 
set at run time indicating the names and locations of the files to be processed. Batch processing 
in this manner is usefiil in situations where an entity is executing the service software on behalf 
of a nimiber of disparate clients or entities. Where there are no additional transactions to process, 
the process concludes, step 720. 

Returning to step 706, a check is performed to determine if the timeout threshold 
has been exceeded. Where the information resource is retrieved before the timeout threshold 
expires, step 706, the service software calculates the elapsed time required to retrieve the 
information resource, step 710. Based on the calculated time required to retrieve the information 
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resource, step 710, the DVC logic is executed, step 712, as explained by the analysis of the 
exemplary DVC logic presented in Fig. 5. Using the DVC logic comprising the step presently 
being executed, the service software determines the appropriate DVC code to return, step 714, 
and, therefore, the status of the server hosting the retrieved information resource. The DVC code 
5 for the step is preferably written to an output file on a persistent storage device when returned, 
step 714, although it may also be written to transient memory and presented on a display device. 

The service software returns the DVC code for the step being played back, step 
714, and a check is performed to determine if the transaction comprises additional steps 
remaining to be played back, step 716. Where additional steps remain to be played back, step 
f^i O 716, processing returns to step 704 where the software plays the next step. If all steps in the 
fU transaction have been processed, step 716, a check is performed to determine if any additional 
iU transactions are slated for playback, step 718. Processing retums to step 702 where an additional 
transaction remains to be played back and the process is repeated for each step in the transaction. 
?y If all transactions have been processed, step 718, the process of playing back the transaction data 

ru 

file concludes, step 720. 

a 

While the invention has been described and illustrated in connection with 
preferred embodiments, many variations and modifications as will be evident to those skilled in 
this art may be made without departing from the spirit and scope of the invention, and the 
invention is thus not to be limited to the precise details of methodology or construction set forth 
20 above as such variations and modification are intended to be included within the scope of the 
invention. 
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